Search CORE

13 research outputs found

NEURAL NAMED ENTITY RECOGNITION AND TEMPORAL RELATION EXTRACTION

Author: Ju Meizhi
Publication venue
Publication date: 01/08/2020
Field of study

The University of Manchester - Institutional Repository

A Text Mining Pipeline Using Active and Deep Learning Aimed at Curating Information in Computational Neuroscience

Author: B Bhasuran
C O’Reilly
Christian O’Reilly
CJ Crasto
DLK Yamins
E Underwood
Elisabetta Iavarone
H Pan
H-M Müller
I Spasic
John McNaught
K Ambert
L French
L French
L French
M Habibi
MA Driel Van
Maolin Li
Matthew Shardlow
Meizhi Ju
N Okazaki
N Okazaki
PF Balan
R Richardet
S Hochreiter
S Tokui
S Tripathy
Sophia Ananiadou
The UniProt Consortium
X Vasques
Y Chen
Y Lecun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/11/2018
Field of study

The curation of neuroscience entities is crucial to ongoing efforts in neuroinformatics and computational neuroscience, such as those being deployed in the context of continuing large-scale brain modelling projects. However, manually sifting through thousands of articles for new information about modelled entities is a painstaking and low-reward task. Text mining can be used to help a curator extract relevant information from this literature in a systematic way. We propose the application of text mining methods for the neuroscience literature. Specifically, two computational neuroscientists annotated a corpus of entities pertinent to neuroscience using active learning techniques to enable swift, targeted annotation. We then trained machine learning models to recognise the entities that have been identified. The entities covered are Neuron Types, Brain Regions, Experimental Values, Units, Ion Currents, Channels, and Conductances and Model organisms. We tested a traditional rule-based approach, a conditional random field and a model using deep learning named entity recognition, finding that the deep learning model was superior. Our final results show that we can detect a range of named entities of interest to the neuroscientist with a macro average precision, recall and F1 score of 0.866, 0.817 and 0.837 respectively. The contributions of this work are as follows: 1) We provide a set of Named Entity Recognition (NER) tools that are capable of detecting neuroscience entities with performance above or similar to prior work. 2) We propose a methodology for training NER tools for neuroscience that requires very little training data to get strong performance. This can be adapted for any sub-domain within neuroscience. 3) We provide a small corpus with annotations for multiple entity types, as well as annotation guidelines to help others reproduce our experiments

Infoscience - École polytechnique fédérale de Lausanne

Crossref

E-space: Manchester Metropolitan University's Research Repository

ZENODO

The University of Manchester - Institutional Repository

A Neural Layered Model for Nested Named Entity Recognition

Author: Ananiadou Sophia
Ju Meizhi
Miwa Makoto
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/06/2018
Field of study

The University of Manchester - Institutional Repository

Improving reference prioritisation with PICO recognition

Author: Ananiadou Sophia
Brockmeier Austin
Ju Meizhi
Przybyla Piotr
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/12/2019
Field of study

The University of Manchester - Institutional Repository

An Ensemble of Neural Models for Nested Adverse Drug Events and Medication Extraction with Subwords

Author: Ananiadou Sophia
Ju Meizhi
Miwa Makoto
Nguyen Nhung
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2020
Field of study

The University of Manchester - Institutional Repository

Comparing Neural Models for Nested and Overlapping Biomedical Event Detection

Author: Ananiadou Sophia
Christopoulou Fenia
Espinosa Kurt Junshean
Georgiadis Panagiotis
Ju Meizhi
Miwa Makoto
Publication venue
Publication date: 05/05/2022
Field of study

BACKGROUND: Nested and overlapping events are particularly frequent and informative structures in biomedical event extraction. However, state-of-the-art neural models either neglect those structures during learning or use syntactic features and external tools to detect them. To overcome these limitations, this paper presents and compares two neural models: a novel EXhaustive Neural Network (EXNN) and a Search-Based Neural Network (SBNN) for detection of nested and overlapping events. RESULTS: We evaluate the proposed models as an event detection component in isolation and within a pipeline setting. Evaluation in several annotated biomedical event extraction datasets shows that both EXNN and SBNN achieve higher performance in detecting nested and overlapping events, compared to the state-of-the-art model Turku Event Extraction System (TEES). CONCLUSIONS: The experimental results reveal that both EXNN and SBNN are effective for biomedical event extraction. Furthermore, results on a pipeline setting indicate that our models improve detection of events compared to models that use either gold or predicted named entities

PubMed Central

The University of Manchester - Institutional Repository

COPD Corpus

Author: Ananiadou Sophia
Bakerly Nawar
Gkoutos Georgios
Ju Meizhi
Short Andrea
Thompson Paul
Tsaprouni Loukia
Publication venue
Publication date: 20/06/2019
Field of study

The COPD corpus is a semantically annotated corpus, focussed on phenotypic information, consisting of 30 full-text articles. The corpus has been manually annotated with named entities, using a fine-grained annotation scheme, which aims to capture detailed information about COPD phenotypes. In particular, the annotations may be "nested" within each other. This is to take into account the potentially complex and nested nature of phenotype descriptions, which may include mentions of various other types of concepts within them

Dryad Digital Repository (Duke University)

Data from: Annotating and detecting phenotypic information for chronic obstructive pulmonary disease

Author: Ananiadou Sophia
Bakerly Nawar Diar
Gkoutos Georgios V.
Ju Meizhi
Short Andrea D.
Thompson Paul
Tsaprouni Loukia
Publication venue
Publication date: 26/04/2019
Field of study

Objectives: Chronic obstructive pulmonary disease (COPD) phenotypes cover a range of lung abnormalities. To allow text mining methods to identify pertinent and potentially complex information about these phenotypes from textual data, we have developed a novel annotated corpus, which we use to train a neural network-based named entity recognizer to detect fine-grained COPD phenotypic information. Materials and methods: Since COPD phenotype descriptions often mention other concepts within them (proteins, treatments, etc.), our corpus annotations include both outermost phenotype descriptions and concepts nested within them. Our neural layered bidirectional long short-term memory conditional random field (BiLSTM-CRF) network firstly recognizes nested mentions, which are fed into subsequent BiLSTM-CRF layers, to help to recognize enclosing phenotype mentions. Results: Our corpus of 30 full papers (available at: http://www.nactem.ac.uk/COPD) is annotated by experts with 27 030 phenotype-related concept mentions, most of which are automatically linked to UMLS Metathesaurus concepts. When trained using the corpus, our BiLSTM-CRF network outperforms other popular approaches in recognizing detailed phenotypic information. Discussion: Information extracted by our method can facilitate efficient location and exploration of detailed information about phenotypes, for example, those specifically concerning reactions to treatments. Conclusion: The importance of our corpus for developing methods to extract fine-grained information about COPD phenotypes is demonstrated through its successful use to train a layered BiLSTM-CRF network to extract phenotypic information at various levels of granularity. The minimal human intervention needed for training should permit ready adaption to extracting phenotypic information about other diseases

ZENODO

Dryad Digital Repository (Duke University)

Electronic Archiving System

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Identification of natural products as modulators of OATP2B1 using LC-MS/MS to quantify OATP-mediated uptake

Author: Beecher GR
Chunshan Gui
Cui MY
Fengjiao Wen
Hongjian Zhang
Jialin Bian
Ju WZ
Kelly GS
Krahenbuhl S
Meizhi Shi
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

An ensemble of neural models for nested adverse drug events and medication extraction with subwords

Author: Belousov
Boyer
Cho
Cocos
Dandala
Dang
Florez
Gu
He
Iqbal
Jensen
Jessop
Johnson
Ju
Korkontzelos
Lample
Lance
Li
Makoto Miwa
Meizhi Ju
Mikolov
Nhung T H Nguyen
Nikfarjam
Noreen
Roberts
Sennrich
Snoek
Sophia Ananiadou
Tiftikci
Tsuruoka
Velupillai
Wang
Wu
Wunnava
Xu
Xu
Yadav
Yang
Yeleswarapu
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref